Blogs Search Engine Using RSS Syndication and Fuzzy Parameters
نویسندگان
چکیده
The rapid development of the internet eventually increases the number of internet users triggering the need for an intelligent search engine that is able to minimize the search on world wide web (WWW) and find relevant information as requested. To overcome the issue of finding relevant information as well as minimizing the search on WWW, this paper proposes a search engine that is specifically designed and built using RSS syndication and fuzzy Parameters to search for information contained in blogs. The blogs search engine consists of three main phases: 1) crawling using RSS feeds algorithm; 2) indexing weblogs algorithm; and 3) searching technique using fuzzy logic. In RSS crawling process, the RSS feeds need to be gathered to extract useful information such as title, links, time published, and description. Next, indexing weblogs uses the links to retrieve the blog sites for text processing and for constructing the indexing database. In order to retrieve such information requested or queried by any user, an interface is provided to enable the blog search based on keyword with associated degree of importance. The density of keyword is then computed from the indexing database. The rank of the pages is computed by using fuzzy weighted average. The experiment resulted in mean average precision of 81.7% of total system performance. Keywords—Rss feeds, blog ssearch engine, fuzzy weighted average, keyword density.
منابع مشابه
CS229 Final Project: Clustering News Feeds with Flock
The rise of blogging and RSS (Really Simple Syndication) have created a more personalized way of reading the news, with a much richer diversity of information and perspectives. Unfortunately, though, the rapid growth of these technologies has created a new problem how to handle the vast amount of information now available. Unfortunately, while RSS aggregators have helped to bring all of the inf...
متن کاملRSS Feed Recommendation
Introduction Really Simple Syndication (RSS) Feeds allows users to access blogs and articles in an easy to read format. It cuts out the overhead of navigating websites for content and allows users to get information more quickly. Currently, the user is in total control of their RSS feeds, adding and deleting feeds according to their tastes. This requires the user to actively search out RSS feed...
متن کاملMemeta: A Framework for Multi-Relational Analytics on the Blogosphere
The “memeta” project is developing a framework for studying the structure and content of the blogosphere. We are particularly interested in how metadata about blogs can be discovered, extracted and computed, and how this metadata can be modeled, represented and analyzed to provide new blog related services. Weblogs, or blogs, are web sites consisting of dated entries (posts) typically organized...
متن کاملRSS, OPML and Weblog Ecosystems: A Survey of New Technologies in Internet Publication
With the rapid growth of weblogs (or “blogs”) over the past year, users require a way of rapidly accessing recent content from many different websites. Traditional websites are inadequate for this, as their content and presentation information are inseparably intertwined. This paper describes the development of the RSS (Really Simple Syndication) specifications as a solution for this problem. T...
متن کاملA Comparing between the impacts of text based indexing and folksonomy on ranking of images search via Google search engine
Background and Aim: The purpose of this study was to compare the impact of text based indexing and folksonomy in image retrieval via Google search engine. Methods: This study used experimental method. The sample is 30 images extracted from the book “Gray anatomy”. The research was carried out in 4 stages; in the first stage, images were uploaded to an “Instagram” account so the images are tagge...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012